Using active learning selection approach for cross-project software defect prediction
نویسندگان
چکیده
Cross-project defect prediction (CPDP) technology can effectively ensure software quality, which plays an important role in engineering. When encountering a newly developed project with insufficient training data, CPDP be used to build predictors using other projects. However, does not take into account the prior knowledge of target items and class imbalance source item data. In this paper, we design active learning selection algorithm for cross-project alleviate above problems. First, use clustering algorithms filter label some representative data from these as guide items. Then, is Finally, balanced cross-item dataset constructed algorithm, model built. article, selected 10 open-source projects by common models, algorithms, evaluation metrics. The results show that proposed solve problem improve performance.
منابع مشابه
Cross-project defect prediction
Prediction of software defects works well within projects as long as there is a sufficient amount of data available to train any models. However, this is rarely the case for new software projects and for many companies. So far, only a few have studies focused on transferring prediction models from one project to another. In this paper, we study cross-project defect prediction models on a large ...
متن کاملTransfer learning for cross-company software defect prediction
0950-5849/$ see front matter 2011 Elsevier B.V. A doi:10.1016/j.infsof.2011.09.007 ⇑ Corresponding author. Tel.: +86 028 61830557; fa E-mail addresses: [email protected] (Y. Ma), g [email protected] (X. Zeng), [email protected] Context: Software defect prediction studies usually built models using within-company data, but very few focused on the prediction models trained with cross-company da...
متن کاملAnalysis of Software Project Reports for Defect Prediction Using KNN
Defect severity assessment is highly essential for the software practitioners so that they can focus their attention and resources on the defects having a higher priority than the other defects. This would directly impact resource allocation and planning of subsequent defect fixing activities. In this paper, we intend to predict a model which will be used to assign a severity level to each of t...
متن کاملOn Software Defect Prediction Using Machine Learning
The goal of this paper is to catalog the software defect prediction using machine learning. Over the last few years, the eld of software defect prediction has been extensively studied because of it's crucial position in the area of software reliability maintenance, software cost estimation and software quality assurance. An insurmountable problem associated with software defect prediction is th...
متن کاملSoftware Defect Prediction Using Ensemble Learning Survey
Machine learning is a science that explores the building and study of algorithms that can learn from the data. Machine learning process is the union of statistics and artificial intelligence and is closely related to computational statistics. Machine learning takes decisions based on the qualities of the studied data using statistics and adding more advanced artificial intelligence heuristics a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Connection science
سال: 2022
ISSN: ['0954-0091', '1360-0494']
DOI: https://doi.org/10.1080/09540091.2022.2077913